Robust time borrowing pulse latches

ABSTRACT

Configurable time-borrowing flip-flops may be based on configurable pulse generation circuitry and pulse latches. The circuitry may use a self-timed architecture that controls the width of clock pulses that are generated so that the pulse latches that are controlled by the clock pulses exhibit a reduced risk of race through conditions. Latch circuitry may be provided that is based on a pulse latch and an additional latch connected in series with the pulse latch. In situations in which there is a potential for race through conditions on an integrated circuit, the additional latch may be switched into use to convert the latch circuitry into an edge-triggered flip flop. Clock trees may be provide with configurable shorting structures that help to reduce clock skew. Low-contention clock drivers may drive signals onto the clock tree paths.

This application is a continuation of patent application Ser. No.12/976,752 filed Dec. 22, 2010 now U.S. Pat. No. 8,115,530, which is adivision of patent application Ser. No. 12/060,795, filed Apr. 1, 2008,now U.S. Pat. No. 7,872,512, which are hereby incorporated by referenceherein in their entireties. This application claims the benefit of andclaims priority to patent application Ser. No. 12/976,752 filed Dec. 22,2010, and patent application Ser. No. 12/060,795, filed Apr. 1, 2008,now U.S. Pat. No. 7,872,512.

BACKGROUND

This invention relates to latch circuits in integrated circuits, andmore particularly, to configurable time borrowing latch circuits andassociated circuitry that may be used to help logic designers improvecircuit performance.

Integrated circuits typically contain combinational logic and sequentiallogic. Combinational logic does not include storage elements. The outputof a given combinational logic circuit is therefore determined solely byits present inputs. Sequential logic circuits contain storage elementswith outputs that reflect the past sequence of their input values. As aresult, the output of a sequential circuit is determined by both itspresent inputs and by the data stored in its storage elements.

Commonly-used sequential circuit storage elements includelevel-sensitive latches and flip-flops.

In a level-sensitive latch, the latch output is controlled by the levelof a clock (enable) input. When the clock is high, the latch outputtracks the value of the input. When the clock transitions from high tolow, the output state of the latch is frozen at whatever value waspresent just prior to the transition. So long as the clock is low, theoutput of the latch will be maintained in its frozen state.

Flip-flops are edge-triggered devices that change state on the rising orfalling edge of an enable signal such as a clock. In arising-edge-triggered flip-flop, the flip-flop samples its input stateonly at the rising edge of the clock. This sampled value is thenmaintained until the next rising edge of the clock.

Flip-flop-based logic circuits are often preferred over latch-basedcircuits, because the regularity imposed by the edge-triggeredproperties of flip-flops makes circuit timing behavior relativelystraightforward to model and hence simplifies design.

However, in a conventional flip-flop-based logic circuit, the clockfrequency must generally be slowed down sufficiently to accommodate thedelay associated with the circuit's slowest combinational logic paths.Even if circuitry in a fast logic path produces a valid signal in lesstime than a slow logic path, that signal is not used until the edge ofthe next clock pulse. While the regularity imposed by conventionalflip-flop circuits is beneficial for ease of circuit design, it tends tolimit performance in certain situations.

Time borrowing schemes have been developed to try to address thisproblem. For example, time borrowing schemes have been developed inwhich various delays are provided in the clocks feeding the edgetriggered flip-flops on a circuit. By selecting appropriate delays forthe clocks, a circuit designer can configure a logic circuit so thatflip-flops in slower paths have their clock edges delayed. This allowstime to be borrowed from fast logic paths and provided to slow logicpaths, so that the clock speed for the entire circuit need not be slowedto accommodate worst-case delays.

With these conventional time borrowing schemes, it can be difficult toobtain optimal performance due to the limited number of delays that areavailable from the clock network. Other such schemes for improvingtiming performance may have limited applicability or requireunacceptably complex analysis. For example, time borrowing flip flopshave been developed that provide a fixed and relatively small amount oftime borrowing. These schemes cannot provide optimal performance in manycircuits.

Moreover, conventional time borrowing schemes may be prone to problemsassociated with race conditions and clock timing issues.

It would be desirable to be able to provide improved time borrowingcircuits to optimize circuit performance on integrated circuits such asprogrammable logic devices.

SUMMARY

In accordance with the present invention, pulse latches may be providedthat serve as configurable time borrowing flip flops. The pulse latchesmay be clocked using clock pulses. A self-timed pulse generation circuitmay be provided to generate the clock pulses. When there is a risk ofrace through, the pulse generation circuit generates short clock pulsesthat are able to minimize the risk for race through conditions. Whenthere is no risk of race through, the pulse generation circuit maygenerate a wider clock pulse to support pulse latch time borrowingoperations.

Clock tree structures may be used to distribute clock signals to logiccircuits. For example, two separate paths in a clock tree may be used todistribute respective potentially skewed versions of a common clocksignal to two physically adjacent logic circuits. To prevent clock skew,a configurable clock tree shorting structure may be used to short theclock distribution paths together. Low-contention clock drivers may beused to prevent shorting current from flowing when the clockdistribution paths are shorted together with a configurable clock treeshorting structure.

The pulse latches may be provided with additional latch circuitry. Whenthe pulse latches are used for time borrowing, the additional latchcircuitry may be disabled. In situations where there is a risk of racethrough conditions, the additional latch circuitry may be enabled toconvert the pulse latch into an edge-triggered flip flop.

Further features of the invention, its nature and various advantageswill be more apparent from the accompanying drawings and the followingdetailed description of the preferred embodiments.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram of an illustrative programmable logic deviceintegrated circuit in accordance with an embodiment of the presentinvention.

FIG. 2 is a diagram of a conventional flip-flop.

FIG. 3 is a timing diagram showing definitions for timing constraints ina configurable time borrowing flip-flop in accordance with an embodimentof the present invention.

FIG. 4 is a diagram showing how a worst-case path in a circuit can limitclock frequency in conventional circuits.

FIG. 5 is a diagram showing how the performance of a circuit of the typeshown in FIG. 4 can be improved using configurable time borrowingflip-flops in accordance with an embodiment of the present invention.

FIG. 6 is a diagram of an illustrative system environment in which aprogrammable circuit containing configurable time borrowing flip-flopsmay be used in accordance with an embodiment of the present invention.

FIG. 7 is a flow chart of illustrative steps involved in designing andconfiguring a programmable circuit containing configurable timeborrowing flip-flops in accordance with an embodiment of the presentinvention.

FIG. 8 is a diagram of a configurable time borrowing flip-flop based ona pulsed latch in accordance with an embodiment of the presentinvention.

FIG. 9 is a diagram of an illustrative configurable-width clock pulsethat may be generated in a circuit of the type shown in FIG. 8 inaccordance with an embodiment of the present invention.

FIG. 10 is a diagram showing how there may be a potential for a racethrough condition to arise in a logic circuit when using a configurabletime borrowing flip-flop in accordance with an embodiment of the presentinvention.

FIG. 11 is a diagram of a configurable time borrowing flip-flop based ona pulsed latch in accordance with an embodiment of the presentinvention.

FIG. 12 is a diagram of a self timed pulse generator circuit that may beused to control time borrowing circuits of the type shown in FIG. 11 toreduce the potential for race through in accordance with an embodimentof the present invention.

FIG. 13 is a diagram of a clock tree that may be used on an integratedcircuit such as a programmable logic device integrated circuit todistribute clock signals in accordance with an embodiment of the presentinvention.

FIG. 14 is a diagram of a clock driver circuit of the type that may beused to drive clock signals onto clock paths such as the clock row pathsof FIG. 13 in accordance with an embodiment of the present invention.

FIGS. 15 and 16 are diagrams of illustrative pulse latch circuits thatmay be selectively configured as edge triggered flip-flops in accordancewith an embodiment of the present invention.

FIG. 17 is a timing diagram illustrating the behaviour of signals CLK,LAB_CLK, and PE in accordance with an embodiment of the presentinvention.

FIG. 18 is a timing diagram showing the relationship between signal CLKand other signals in an illustrative timing circuit for a low contentionclock driver in accordance with an embodiment of the present invention.

DETAILED DESCRIPTION

The present invention relates to improving circuit performance insynchronous logic circuits using time borrowing techniques. The logiccircuits may be part of any suitable integrated circuits. For example,the logic circuits and time borrowing circuitry of the present inventionmay be implemented on programmable logic device integrated circuits. Ifdesired, the logic circuits and time borrowing circuitry may beimplemented on programmable integrated circuits that are nottraditionally referred to as programmable logic devices such asmicroprocessors containing programmable logic, digital signal processorscontaining programmable logic, custom integrated circuits containingregions of programmable logic, or other programmable integrated circuitsthat contain programmable logic. The present invention will generally bedescribed in the context of integrated circuits such as programmablelogic device integrated circuits as an example.

An illustrative integrated circuit such as a programmable logic device10 in accordance with the present invention is shown in FIG. 1.

Programmable logic device 10 has input/output circuitry 12 for drivingsignals off of device 10 and for receiving signals from other devicesvia input/output pins 14. Interconnection resources 16 such as globaland local vertical and horizontal conductive lines and buses may be usedto route signals on device 10. Interconnection resources 16 includeconductive lines and programmable connections between respectiveconductive lines and are therefore sometimes referred to as programmableinterconnects 16.

Programmable logic 18 may include combinational and sequential logiccircuitry. The programmable logic 18 may be configured to perform acustom logic function. The programmable interconnects 16 may beconsidered to be a type of programmable logic 18.

Programmable logic device 10 contains programmable memory elements 20.Memory elements 20 can be loaded with configuration data (also calledprogramming data) using pins 14 and input/output circuitry 12. Onceloaded, the memory elements each provide a corresponding static controlsignal that controls the operation of an associated logic component inprogrammable logic 18. In a typical scenario, the outputs of the loadedmemory elements 20 are applied to the gates of metal-oxide-semiconductortransistors in programmable logic 18 to turn certain transistors on oroff and thereby configure the logic in programmable logic 18 and routingpaths. Programmable logic circuit elements that may be controlled inthis way include parts of multiplexers (e.g., multiplexers used forforming routing paths in programmable interconnects 16), look-up tables,logic arrays, AND, OR, NAND, and NOR logic gates, pass gates, etc.

Memory elements 20 may use any suitable volatile and/or non-volatilememory structures such as random-access-memory (RAM) cells, fuses,antifuses, programmable read-only-memory memory cells, mask-programmedand laser-programmed structures, combinations of these structures, etc.Because memory elements 20 are loaded with configuration data duringprogramming, memory elements 20 are sometimes referred to asconfiguration memory, configuration RAM, or programmable memoryelements.

The circuitry of device 10 may be organized using any suitablearchitecture. As an example, the logic of programmable logic device 10may be organized in a series of rows and columns of larger programmablelogic regions each of which contains multiple smaller logic regions. Thesmaller regions may be, for example, regions of logic that are sometimesreferred to as logic elements, each containing a look-up table, one ormore registers, and programmable multiplexer circuitry. The largerregions may be, for example, logic array blocks containing multiplelogic elements. In the example of FIG. 1, illustrative logic elements 22are shown in one of the larger regions of programmable logic 18 in FIG.1 (which may be a logic array block as an example). In a typicalprogrammable logic device 10, there may be hundreds or thousands oflogic elements 22. The four logic elements 22 that are shown in FIG. 1are merely illustrative.

During device programming, configuration data is loaded into device 10that configures the logic elements 22 so that each logic elementperforms a desired logic function on its inputs and producescorresponding output signals at one or more logic element outputs.

The resources of device 10 such as programmable logic 18 may beinterconnected by programmable interconnects 16. Interconnects 16generally include vertical and horizontal conductors. These conductorsmay include global conductive lines that span substantially all ofdevice 10, fractional lines such as half-lines or quarter lines thatspan part of device 10, staggered lines of a particular length (e.g.,sufficient to interconnect several logic array blocks or other suchlogic areas), smaller local lines, or any other suitable interconnectionresource arrangement. If desired, the logic of device 10 may be arrangedin more levels or layers in which multiple large regions areinterconnected to form still larger portions of logic. Still otherdevice arrangements may use logic that is not arranged in rows andcolumns.

In addition to the relatively large blocks of programmable logic thatare shown in FIG. 1, the device 10 generally also includes someprogrammable logic associated with the programmable interconnects,memory, and input-output circuitry on device 10. For example,input-output circuitry 12 may contain programmable input and outputbuffers. Interconnects 16 may be programmed to route signals to adesired destination.

Much of the logic circuitry on device 10 contains storage elementsreferred to as flip-flops. Flip-flops are edge triggered devices thatcan store digital data (e.g., a logic one or a logic zero).

A conventional flip-flop 24 is shown in FIG. 2. Flip-flop 24 has a datainput line 30 and a data output line 32. Level sensitive latches 26 and28 are used to store data. Latches 26 and 28 are clocked by clock signalCLOCK at clock input line 34. Latch 26 has an inverting clock input 62.Latch 28 has a non-inverting clock input 64. Because input 62 isinverted with respect to input 64, latches 26 and 28 are effectivelyclocked with clock signals that are 180° out of phase. Latch 26 respondsto an inverted clock signal and latch 28 responds to noninverted clocksignal CLOCK.

Conventional flip-flops such as flip-flop 24 of FIG. 2 are characterizedby a setup time Tsu, a hold time Th, and a clock-to-output time Tco.

The setup time Tsu of a flip-flop represents the minimum amount of timethat data D on input line 30 must be valid before the signal CLOCK goeshigh on clock input 34 and its inverse goes low at the input 62 of latch26. If data is presented to input 30 at a time that is closer in time tothe clock edge than Tsu, proper operation of the flip-flop 24 cannot beguaranteed.

The hold time Th relates to the behavior of latch 26 and represents theminimum amount of time that the data D is required to be stable afterthe CLOCK edge (i.e., after the inverse of CLOCK falls). If D changesstate before Th is complete, the output Q of the flip-flop may not becorrect. The clock-to-output time Tco represents the amount of time theflip-flop takes to change its output after the clock edge. Theserelationships are illustrated in the timing diagram of FIG. 3.

Although the data D that is presented to a flip-flop's input must bestable at least Tsu before the clock edge, there is no prohibition ondata arriving at earlier times. The amount of time before the clock edgethat stable data is presented at the input to a flip-flop may bereferred to as Td. The variable Tcq is sometimes used to represent theamount of time after the edge of the clock signal CLK that output data Qis stable for a given value of Td. In some situations, data may bepresented very close in time to the clock edge (e.g., close to or at theminimum established by the setup time Tsu). In this type of situation,Td will equal Tsu (or will be very close to Tsu). In other situations,data may be presented farther from the clock edge. In this type ofsituation, Td will be greater than Tsu.

The values of Td and Tcq are interrelated. The variable Td can beadjusted, while the variable Tcq is a function of Td. At Td values thatare greater than or equal to Tsu, the flip-flop is expected to workproperly, with a Tcq value of Tco (i.e., Tcq will be equal to theminimum amount of time required for the output to become valid after theclock edge—Tco). At Td values below Tsu, however, the setup timerequirements of the flip-flop are violated because the data is arrivingtoo close to the clock edge. In this situation, no valid data will bepresented to the output of the flip-flop and the flip-flop will notperform properly.

With conventional flip-flops of the type shown in FIG. 2, time borrowingis not possible, which limits circuit performance. Consider, as anexample, the illustrative circuit 74 of FIG. 4. In circuit 74, data isprocessed in a loop formed from flip-flop 76, logic circuit 78,flip-flop 80, and logic circuit 82.

In the example of FIG. 4, flip-flops 76 and 80 are conventionalflip-flops without time borrowing capabilities. As a result, flip-flops76 and 80 have fixed values of Tsu and Tco. In the FIG. 4 example, thevalue of Tsu for flip-flops 76 and 80 is 200 ps and the value of Tco forflip-flops 76 and 80 is 100 ps. As shown in FIG. 4, logic circuit 78 (inthis example) has an associated processing delay of 1000 ps. Logiccircuit 82 has an associated delay of 800 ps. Output signals from logic82 are fed back to logic circuit 78 via feedback path 84.

In circuits such as circuit 74, the maximum clock frequency that can beused is dictated by the worst-case path delay. There are two relevantpath delays to consider in circuit 74. The delay of the first path isassociated with the path from point 66 to point 68. Note that althoughTsu and Tco are associated with the internal logic of the flip-flop,they are considered to be part of the path in this discussion and areillustrated as being outside of the flip-flop. The delay of the firstpath is equal to 1300 ps (100 ps+1000 ps+200 ps). The delay of thesecond path is associated with the path from point 70 to point 72. Thisdelay is equal to 1100 ps (100 ps+800 ps+200 ps). Because the delayassociated with the slowest path is the limiting factor in determiningthe maximum clock speed, the maximum clock speed that can be used inclocking circuit 74 is governed by the path delay of 1300 ps that isassociated with the path from point 66 to point 68. Even though thedelay associated with the path from point 70 to point 72 is lower (1100ps), this smaller delay cannot be exploited using conventionalflip-flops. As a result, it is necessary to set the clock rate forcircuit 74 to a frequency of about 1/(1300 ps) or 769 MHz.

In accordance with the present invention, pulse latches and othercircuitry may serve as time borrowing flip flops. In arrangements inwhich time borrowing latch circuits (time-borrowing flip-flops) areused, circuit performance can be improved by reducing the delayassociated with worst-case paths. Consider, as an example, the situationin which time-borrowing flip-flops are used in circuit 74.

The use of time-borrowing flip-flops allows the clock speed ofillustrative circuit 74 of FIG. 4 to be increased. Consider, as anexample, the arrangement of FIG. 5. In FIG. 5, circuit 74 is usingconfigurable time-borrowing flip-flops 86-1 and 86-2 in place ofconventional flip-flops 76 and 80 of FIG. 4. In the FIG. 5 example,flip-flop 86-1 is operated with a Tsu value of 200 ps and a Tco value of100 ps, as with flip-flop 76 of FIG. 4. Flip-flop 86-2 of FIG. 5operates with a Tsu value of 100 ps and a Tco value of 200 ps. Bothflip-flop timing environments result in an identical Tdq value (300 ps).Nevertheless, operating flip-flop 86-2 with a Tsu value of 100 psresults in an improved balance for the delays associated with the pathsin circuit 74.

In particular, the delay associated with the path between point 66 and68 is reduced to 1200 ps (in the FIG. 5 arrangement) from 1300 ps (inthe FIG. 4 arrangement), whereas the delay associated with the pathbetween point 70 and 72 is increased to 1200 ps (in the FIG. 5arrangement) from 1100 ps (in the FIG. 4 arrangement). Although one pathis made slower than before, the other path is made faster and, onbalance, the entire circuit 74 of FIG. 5 can operate more rapidly. Thisis because the maximum clock speed of the clock for circuit 74 isdictated by the worst case delay. In the example of FIG. 4, the maximumclock frequency for circuit 74 was 1/(1300 ps). In the example of FIG.5, the maximum clock frequency for circuit 74 has been increased to1/(1200 ps) or 833 MHz. This improvement in circuit performance is theresult of time borrowing by flip-flop 86-2.

When configurable time-borrowing flip-flops are used, the amount of timeborrowing that is associated with a given time borrowing flip flop maybe adjusted by adjusting configurable delay elements associated with theflip-flops. The settings of the delay elements may, for example,configure the pulse width at the input to a pulse latch circuit thatserves as a time-borrowing flip flop. Detailed knowledge of whichportions of a circuit can tolerate larger delay element values and whichportions of a circuit require relatively low delays can be providedusing computer-aided design (CAD) circuit design tools.

An illustrative system environment for integrated circuits such asprogrammable integrated circuits that contain configurable timeborrowing flip-flops is shown in FIG. 6. Programmable logic device 10(or any other suitable integrated circuit containing time-borrowingflip-flop circuitry) may be mounted on a board 36 in a system 38.Programmable logic device 10 may receive configuration data fromprogramming equipment or from any other suitable equipment or device.

In the example of FIG. 6, programmable logic device 10 is the type ofprogrammable logic device that receives configuration data from anassociated integrated circuit 40. With this type of arrangement, circuit40 may, if desired, be mounted on the same board 36 as programmablelogic device 10. The circuit 40 may be a programmable logic deviceconfiguration data loading chip that loads configuration data intoprogrammable logic device memory from an external erasable-programmableread-only memory (EPROM) chip, a programmable logic device configurationdata loading chip with built-in memory (sometimes referred to as aconfiguration device), or any other suitable device. When system 38boots up (or at another suitable time), the configuration data forconfiguring the programmable logic device may be supplied to theprogrammable logic device from device 40, as shown schematically by path42. The configuration data that is supplied to the programmable logicdevice may be stored in the programmable logic device in itsconfiguration random-access-memory elements 20.

System 38 may include processing circuits 44, storage 46, and othersystem components 48 that communicate with device 10. The components ofsystem 38 may be located on one or more boards such as board 36 or othersuitable mounting structures or housings.

Communication paths may be used to interconnect device 10 to othercomponents. For example, communication path 37 may be used to conveydata between an integrated circuit 39 that is mounted on board 36 andprogrammable logic device 10. Communication paths 35 and 50 can be usedto convey signals between programmable logic device 10 and components44, 46, and 48.

Configuration device 40 may be supplied with the configuration data fordevice 10 over a path such as path 52. Configuration device 40 may, forexample, receive the configuration data from configuration data loadingequipment 54 or other suitable equipment that stores this data inconfiguration device 40. Device 40 may be loaded with data before orafter installation on board 36.

It can be a significant undertaking to design and implement a desiredlogic circuit in a programmable logic device. Logic designers thereforegenerally use logic design systems based on computer-aided-design (CAD)tools to assist them in designing circuits. A logic design system canhelp a logic designer design and test complex circuits for a system.When a design is complete, the logic design system may be used togenerate configuration data for electrically programming the appropriateprogrammable logic device.

As shown in FIG. 6, the configuration data produced by a logic designsystem 56 may be provided to equipment 54 over a path such as path 58.The equipment 54 provides the configuration data to device 40, so thatdevice 40 can later provide this configuration data to the programmablelogic device 10 over path 42. System 56 may be based on one or morecomputers and one or more software programs. In general, software anddata may be stored on any computer-readable medium (storage) in system56 and is shown schematically as storage 60 in FIG. 14.

In a typical scenario, logic design system 56 is used by a logicdesigner to create a custom circuit design. The system 56 producescorresponding configuration data which is provided to configurationdevice 40. Upon power-up, configuration device 40 and data loadingcircuitry on programmable logic device 10 is used to load theconfiguration data into the programmable memory elements 20 of device10. Device 10 may then be used in normal operation of system 38.

Illustrative steps involved in configuring and using a programmablelogic device 10 that contains configurable time-borrowing flip-flops areshown in FIG. 7.

At step 122, a user (e.g., a logic designer) creates a custom logicdesign for a programmable logic device 10 using logic design system 56(FIG. 6). The user provides computer aided design (CAD) tools on logicdesign system 56 (FIG. 6) with information on the custom logic design.The input data may be in the form of design data (e.g., specificationsfor a logic circuit or other circuit) and constraint data (e.g.,settings that set limits on acceptable circuit performance and settingsthat control how the computer aided design tools 12 should process thedesigner's input). As an example, step 122 may involve the constructionand entry of a logic circuit design to be implemented in a particulartype of programmable logic device with certain size and timingconstraints. The tools used to enter the logic circuit input datainclude tools such as text editors, floorplan editors, constrainteditors, library-based and wizard-based design and constraint entryaids, etc. As another example, the design and constraint data entered atstep 122 may relate to a circuit design that is to be implemented as anapplication specific integrated circuit (ASIC). In this context, atypical computer aided design tool 12 for accepting design andconstraint data is a place-and-route tool. In general, any suitableformat may be used for design and constraint data. Constraint data mayinclude information on any suitable constraints (e.g., design objectplacement constraints, timing constraints, area constraints, noise levelconstraints, power consumption constraints, thermal constraints, etc.)Design and constraint data may be stored in databases on system 56(storage 60).

At step 124, computer aided design tools on system 56 may be run toimplement the circuit design. If, for example, the design is a logicdesign for a particular type of programmable logic device integratedcircuit, CAD tools may be used to implement the logic design in a targettechnology. The CAD tools may include a logic compiler and otherprograms of the type that are typically provided as a suite of logicdesign and analysis tools. Programming data (e.g., in a .pof file) orother suitable output data representing the implemented design may begenerated. If the design is a circuit design for an application specificintegrated circuit, an application specific integrated circuitplace-and-route tool and/or other suitable tools may be used to generatea series of lithographic mask specifications for fabricating the circuitas a silicon chip.

During step 124, analysis operations are performed using analysis toolssuch as timing analysis tools. The timing analysis performed during step124 allows logic design system 56 to identify optimum time borrowingsettings for each flip-flop and its configurable delay elementcircuitry. With one suitable arrangement, all flip-flops are firstinitialized to a minimum time borrowing setting. The timing analysistools may then assign successively larger time borrowing settings. Thisprocess may be iterated until it converges. During a verificationoperation, the timing analysis tools can check that setup timeconstraints have been satisfied and that minimum expected data arrivaltimes (for data signals D) do not violate the hold time requirements forany of the flip-flops. Once verification is complete, the CAD tools canincrease the time borrowing setting for each flip-flop that cannot meetsetup time constraints. The configuration data that is generated at step124 includes configuration data for adjusting the configuration delayelements (pulse latch pulse generation circuitry) of each flip-flop toan appropriate setting.

Once an optimum implementation of the custom circuit design has beenidentified and configuration data for that design has been generated atstep 124, the configuration data can be loaded into the programmablelogic device 10 (step 126), as described in connection with FIG. 6. Theloaded data configures the time borrowing flip-flops so that clockspeeds can be made faster than would otherwise be possible.

If desired, time-borrowing flip-flops may be implemented usingconfigurable pulsed latches. Configurable pulsed latch arrangements useslatching circuitry that is transparent (i.e., output tracking input) fora configurable amount of time Tsk to serve as a configurable timeborrowing flip-flop.

An illustrative embodiment of a configurable time borrowing flip-flop 86based on a pulsed latch is shown in FIG. 8. Flip-flop 86 has aconfigurable pulse generation circuit 87 that is based on a configurabledelay circuit 94. Clock signal CLK is provided on clock input line 92and is distributed to delay element 106 via input 112. Delay element 106produces four different delays (in this illustrative example). The clocksignal on input 112 is therefore delayed by four corresponding amountsat the outputs 110 of delay element 106. Multiplexer 108, which isadjusted by the states of programmable elements 20, provides a selectedone of the delayed versions of the clock signal on line 134 through aninverting output. Logic AND gate 130 receives the delayed clock signalon line 134 and the original clock signal CLK on line 92 and produces apulse CP of an adjustable width Tpw on its output 132. The delayed clocksignal on line 134 is inverted by the inverting output of multiplexer108, which may be regarded as a phase delay of 180°. An illustrativeclock pulse CP is shown in FIG. 9. The width Tpw is determined by theamount of delay selected by multiplexer 108.

The clock pulse CP is provided to the clock input CL of level-sensitivelatch 128. Latch 128 receives data D from flip-flop input line 88 andproduces corresponding output data Q on flip-flop output line 90. Usingconfigurable pulse generation circuitry 87, the pulse width Tpw can beset to a value that maintains the latch 128 in its transparent state fora time appropriate to implement a desired amount of time borrowing.

It may be advantageous to use configurable time borrowing flip-flops inlogic elements 22 (FIG. 1). A typical logic element may have a look-uptable that supplies a data output to path 88 of flip-flop 86. Ifdesired, data may be supplied to path 88 from other sources (e.g., fromrouting lines in a logic array block, etc.).

Pulse latches such as the pulse latch of FIG. 8 may allow time borrowingand thereby improve performance of a given logic circuit on aprogrammable logic device. However, short paths on a device may causehold time issues. For example, short paths have the potential to lead tohold time violations in devices that have been fabricated underparticular process conditions.

As described in connection with FIG. 8, each time-borrowing pulse latchcircuit 86 may be implemented as a level sensitive latch to maximizespeed. For proper operation of circuit 86 of FIG. 8, the pulse of FIG. 9must be wide enough to latch data. However in some situations in whichthe pulse is wider than its minimum setting and the data path is fasterthan usual, a race through condition may arise. The race through hazardfor pulse latches with short data path delay is illustrated in FIG. 10.

In FIG. 10, a clock signal CLK is provided to path 140. Delay circuit 87produces a corresponding clock pulse CP on path 142. Clock pulsedistribution path 144 may be used to distribute clock pulse CP tomultiple time-borrowing flip flops (i.e., pulse latches A and B). Onlylow-to-high transitions on clock signal CLK create corresponding CPpulses, so latches A and B exhibit characteristics of positive edgetriggered devices.

Each latch may be contained in a respective logic element (as anexample). Latch A has a data input connected to input path 146. Latch Bhas a data input connected to input path 154. When clock pulse CParrives at the clock input to latch A, latch A latches data signal D andtransmits a corresponding output data signal Q through output path 148,through a short routing path such as local line 150, and through logiccircuit 152 (e.g., a look-up table). Because path 148, local line 150,and logic circuit 152 produce only a small amount of delay, the outputsignal from latch A may be received at input 154 to latch B before theclock pulse CP ends. As a result, the data signal D at the input tolatch B may be erroneously changing while pulse signal CP is high. Ifdata signal D at the input to latch B were to transition only after CPhad fallen, there would not be a race through condition and output Q oflatch B would be valid. However, when CP is still high when the input Dto latch B is changing, the output of latch A will affect latch B beforethe next cycle of clock signal CLK is reached. Circuits that exhibitrace through in this way will not function properly and should beavoided.

In accordance with embodiments of the present invention, race througherrors such as these can be avoided by providing a pulse generationcircuit 87 that generates an appropriate minimum pulse width for clockpulse CP and by providing a fail-safe mode of operation in which thepulse latches can be operated as full edge-triggered flip-flops.Structures may also be provided that configurably reduce the skewbetween physically local but electrically distant parts of a clockdistribution network.

An example of a suitable pulse latch that may be used to supportconfigurable time borrowing functions in an integrated circuit such asprogrammable logic device integrated circuit 10 is shown in FIG. 11. Asshown in FIG. 11, pulse latch 86 may have an input 156 that receives alogic signal. The logic signal that is received on input 156 may be, forexample, a combinational function z0(a,b,c0,d0,e,f) from the output oflook-up table 157 (as an example). If desired, pulse latch 86 may alsoreceive a logic signal sdata on input path 160. Complementarymetal-oxide-semiconductor (CMOS) pass gates 162 that are controlled byconfiguration random-access memory cells or dynamic control signals(e.g., control signals from core logic on device 10) may be used toselectively route signals from paths 156 and 160 into level sensitivelatch 164.

If desired, pass transistor T may be turned on by suitable programmingof an associated configuration random-access-memory cell 20. When passtransistor T is off, data may be routed into latch 164. When it isdesired to bypass latch 164, latch 164 may be disabled and configurationrandom-access memory cell 20 may be used to turn transistor T on. Inthis configuration, data may be driven onto output 158 (and connectedrouting lines) via output buffer 166. Transistor T may be, for example,an n-channel metal-oxide-semiconductor pass gate formed using a thickoxide low power design. When transistor T is disabled and levelsensitive latch 164 is enabled, pulse latch 86 may latch eitherz0(a,b,c0,d0,e,f) or sdata.

Pulse latch 86 may be clocked by a pulse generation circuit producingclock pulse CP and inverted clock pulse NCP. These clock pulse signalsmay be applied to terminals 168 of CMOS pass gates 170 as shown in FIG.11. Line 172 at the input of NAND gate 174 may receive a clear controlsignal/CLR. This signal may be used to asynchronously clear pulse latch86. When signal/CLR is low, latch 86 is placed in its normal pulse latchmode.

During normal operation, pulse latches such as latch 86 of FIG. 11 andother suitable pulse latch circuits may be controlled by pulses CP andNCP. In accordance with embodiments of the present invention, pulsegenerator circuitry may be provided that creates clock pulses CP and NCPin a way that avoids race through conditions. In particular, pulsegenerator circuitry may be provided that closely approximates theminimum delay permissible for clock pulses CP and NCP (i.e., thesmallest clock pulse widths that will cause level sensitive latch 164 tocorrectly latch data during operation). In practice, the optimum pulsewidth that is produced by the pulse generator circuitry will tend to beslightly larger than the absolute minimum width required to latch data.This is because a small additional delay above the theoretical minimumpulse width helps to accommodate noise and on-die variations. Excessamounts of delay above this small additional delay will, however,increase the risk of race-through.

Illustrative pulse generator circuitry 176 in accordance with theinvention is shown in FIG. 12. In a typical arrangement, there may beone or two circuits such as circuit 176 per logic array block on device10. This is, however, merely illustrative. If desired, there may be morecircuits 176 or fewer circuits 176 on a given device.

Clock signal CLK may be received at input line 178 to NAND gate NAND2.The NAND gate NAND2 may also receive a clock enable signal CE and apulse enable signal PE on respective inputs 180 and 182. The output ofNAND gate NAND2 is connected to the input of inverter INV3. The outputof inverter INV3 produces the clock pulse signal LAB_CLK on path 184. Asshown schematically by LAB line parasitic capacitance 192, there may bea nonnegligible amount of parasitic capacitance associated with drivingpath 184, as it may be used to drive many logic elements and flip-flops,only one of which is shown. As a result, inverter INV3 may beconstructed using large transistors, as illustrated schematically inFIG. 12 by the relatively large size of inverter INV3.

The clock pulse signal LAB_CLK may be provided to the input of logiccircuitry 186. Circuitry 186 may include optional multiplexer circuitryfor allowing logic elements to select a desired LAB_CLK signal to use asits clock pulse CP or, following inversion, as its inverted clock pulseNCP. There may be, for example, two or more possible LAB_CLK signals ina given region of logic from which to choose (e.g., in multiclockarchitectures). Circuitry 186 may include inverter circuitry to use inproducing inverted clock pulse NCP. As shown in FIG. 12, clock pulsesignals CP and NCP may be provided at respective outputs 188 and 190 oflogic 186.

The clock pulse signal LAB_CLK may be provided to the input of inverterINV2. Output signals from inverter INV2 may be routed to one of the twoinputs to NAND gate NAND1. The other input of NAND gate NAND1 mayreceive signal PE from node 196. A configurable amount of delay may beprovided using one or more delay elements such as delay element 106. Inthe example of FIG. 12, delay element 106 is formed from multipleseries-connected inverters. Multiplexer 108 may be controlled by theconfiguration data loaded into an associated configuration random-accessmemory cell 20. The output from cell 20 may be supplied to the controlinput of multiplexer 108 over path 198. Multiplexer 108 has two inputs,so the illustrative pulse generation circuit 176 of FIG. 12 has twodelay settings (high and low). In general, any suitable number of delaysettings may be provided. (In the example of FIG. 8, delay circuit 94has four associated delay settings.)

Multiplexer M1 receives the output of multiplexer 108 at its “1” inputand receives positive power supply voltage Vcc at its “0” input. Theoutput of multiplexer M1 is provided to node 196. Multiplexer M1 has acontrol input 200 that determines its state. Control input 200 isconnected to line 178 and receives clock signal CLK. When clock CLK ishigh, the output of multiplexer 108 is routed to node 196. When clockCLK is low, multiplexer M1 is configured to route the high voltage onits “0” input to node 196.

When race through is not a problem in a given circuit, a logic designermay load configuration random-access-memory cell 20 of FIG. 12 with alogic zero. This switches the delay of delay element 106 into use andmakes the time borrowing feature of pulse latch 86 (FIG. 11) active. Ifdesired, additional levels of pulse delay adjustment may be provided(e.g., by using a multiplexer 108 with more inputs).

When race through is a problem in a given circuit, a logic designer willopt not to use time borrowing and will load configurationrandom-access-memory cell 20 of FIG. 12 with a logic one. In thissituation, the clock pulses CP and NCP that are produced by circuit 176will have the minimum width suitable for operating latch 86 (FIG. 11)satisfactorily. The characteristics of NAND gate NAND1, inverter INV1,and multiplexer 108 are preferably chosen to replicate as closely aspossible the delay of pulse latch 86 (FIG. 11), so circuit 176 may besaid to be “self-timed.”

The operation of circuit 176 when no time borrowing is being used may beunderstood as follows. Initially, clock signal CLK is low, somultiplexer M1 routes the high voltage on its “0” input to node 196.This takes signal PE high. Clock enable signal CE is initially high.Because CLK is low, the output of NAND gate NAND2 is high. With theoutput of NAND gate NAND2 high, the output of inverter INV3 (i.e., clockpulse signal LAB_CLK) is low (as is associated signal CP). With LAB_CLKlow, the output of inverter INV2 is high. Because signal PE is high andthe output of inverter INV2 is high, both inputs to NAND gate NAND1 arehigh. As a result, the output of NAND gate NAND1 is low. Inverter INV1inverts the low output from NAND gate NAND1 and produces a high signalat its output (node 202). No time borrowing is being used, soconfiguration random-access-memory cell 20 of FIG. 12 is loaded with alogic one and directs multiplexer 108 to connect its “1” input to itsoutput. Because clock CLK is low, the control input to multiplexer M1 online 200 is low and multiplexer M1 connects the high signal on its “0”input to node 196, maintaining signal PE at its high value.

When clock signal CLK transitions from low to high, the control input tomultiplexer M1 on line 200 goes high. As a result, multiplexer M1switches to its “1” input and passes the high input from the output ofinverter INV1 at M1's “1” input to its output. There is thus noimmediate change in the state of signal PE.

At this stage, clock enable signal CE is high and signal PE is high.Accordingly, the output of NAND gate NAND2 goes low when clock signalCLK goes high. Inverter INV3 inverts the low signal at the output ofNAND2 and produces a high LAB_CLK signal. Inverter INV2 inverts thishigh signal, so that the output of inverter INV2 goes low. NAND gateNAND1 has a high input from signal PE. When NAND gate NAND1 receives thelow signal from inverter INV2, the output of NAND gate NAND1 goes high.Inverter INV1 receives the high output from NAND gate NAND1 and takesits output low. Because the output of the configurationrandom-access-memory cell on line 198 is high, the low output ofinverter INV1 passes through the “1” input of multiplexer 108 to the “1”input of multiplexer M1. Clock signal CLK is high, so multiplexer M1 isconfigured to route the low signal on its “1” input to node 196. Thistakes signal PE low.

As a result of the operation of the pulse generation circuit, a signalPE is generated that is an inverted and delayed version of signal CLKand a pulse-shaped signal LAB_CLK is generated on path 184. SignalLAB_CLK goes high (after a short delay) whenever clock signal CLK has arising edge and goes low (after a short delay) shortly after the risingedge of CLK, with a delay determined by the amount of time required forLAB_CLK to rise together with any additional delay from delay chain 106.Signal LAB_CLK (and signals CP and NCP which are true and complementaryversions of LAB_CLK) therefore may be characterized by a pulse shape ofthe type shown in FIG. 9. A timing diagram illustrating the behaviour ofsignals CLK, LAB_CLK, and PE is shown in FIG. 17.

The design of pulse generation circuit 176 ensures that the width ofpulse LAB_CLK (and the widths of associated pulses CP and NCP) can be assmall as possible while being long enough to allow level sensitive latch86 to operate properly. This is because NAND gate NAND1, inverter INV1,and multiplexer 108 of FIG. 12 create just enough delay to operate latch86 (FIG. 11) properly.

The design of pulse generation circuit 176 also produces accurate pulsesthat are not subject to excessive changes due to fabrication processvariations. If process variations are not well controlled, variations inthe strength of inverter INV3 might adversely affect pulse widthaccuracy. However, the arrangement of pulse generation circuit 176 inFIG. 12 compensates for process variations. For example, if processvariations cause inverter INV3 to be stronger than its nominal value,inverter INV3 will cause clock pulse LAB_CLK to rise too early. However,this will cause signal PE to reset to a low value early. The earlytransition of signal PE from high to low will cause the falling edge ofclock pulse LAB_CLK to transition early to compensate. Similarly, ifinverter INV3 is weak due to fabrication variations, the signal LAB_CLKwill tend to rise slowly. This, in turn, will cause signal PE totransition later than usual and will cause LAB_CLK to fall later thanusual. As a result of this compensating circuit behaviour, changes inthe width of LAB_CLK (and pulses CP and NCP) due to integrated circuitmanufacturing process variations are minimized.

Some integrated circuits are subject to potential race throughconditions because of hold time problems arising from clock skew betweenlatches. This type of situation may occur when latches are physicallyclose to each other but are electrically far apart. The clock signalsthat feed the latches may develop skew. For example, two logic arrayblocks may be adjacent to each other, but may be located in differentbranches of a clock tree. The clock tree may be, for example, anH-shaped clock tree of the type sometimes referred to as an H-tree.

In integrated circuits such as programmable logic device integratedcircuits, a number of H-trees may be provided to supply global clocks.Configurable multiplexers or other programmable routing circuits may beused to route the clock signals to desired locations within the trees.For example, a configurable multiplexer may be used to select amongsubset of the clocks at lower levels in a tree (e.g., to select a rowclock (ROWCLK) in a programmable logic device).

In situations such as these, the relative delay in receiving data withtwo different latches might be small, but the clock skew between the twolatches might be large. This can lead to an undesirable race throughcondition. To address clock skew of this type, integrated circuits(e.g., high speed circuits such as processors and programmable logicdevices) may be provided with configurable clock tree shortingstructures. The configurable clock tree shorting structures may beconfigured using memory elements such as memory elements 20.

FIG. 13 shows an illustrative clock tree 210 with configurable clocktree shorting structures. As shown in FIG. 13, clock tree 210 mayreceive a clock from a clock source at an input such as input 212.Buffers such as buffer 214 may be used to drive the clock onto theconductors of clock tree 210. The clock tree is arranged to divide clocksignals onto paths that are of relatively equal length. This helps toequalize the clock signal distribution path length that is experiencedby circuits on the integrated circuit (e.g., on a programmable logicdevice 10). For example, at node 216, clock signals may be split onto afirst path that conveys clock signals to node 228 in direction 220 andsecond path that conveys clock signals to node 222 in direction 218. Atnodes 228 and 222, the clock signals are again split, giving rise to theH-clock shape of clock tree 210. In particular, at node 228, the clocksignal may be split into a first path that conveys the clock indirection 230 and a second path that conveys the clock in direction 232and at node 222 the clock signal may be split into a first path thatconveys the clock in direction 224 and a second path that conveys theclock in direction 226.

The clock signal may be routed to logic circuits in rows such as row234. Multiplexers or other suitable switching structures may be used toconvey the clock onto the clock conductors of each row. In the exampleof FIG. 13, multiplexer 240 routes row clock signals onto left-hand path242 and multiplexer 236 routes clock signals onto right-hand path 238.

The delay differences for the clock signal from input 212 at the centerof a particular row may cause skew between the version of the clock onthe left half of the row and the version of the clock on the right halfof the row. To reduce or eliminate this skew, a logic designer can loadprogramming data into appropriate memory elements 20. The outputs ofthese memory elements may be provided to the gates of NMOS passtransistors or other suitable configurable clock tree shortingstructures. The configuration data in the memory elements may cause thepass transistors to selectively short various clock path conductorstogether. For example, a memory element 20 may produce a control signalon its output (e.g., a logic high) that turns on transistor 246 andthereby shorts lines 242 and 248 together in row 234. This reduces clockskew between regions of logic such as logic elements 252 and 254, sothat logic elements 252 and 254 function properly at high clockfrequencies.

The low resistance connection between respective branches of an H-treeor other clock distribution network that is provided by transistors suchas transistor 246 is preferably generated when the branches are beingused to convey respective versions of the same clock signal (e.g., aclock signal CL). In a programmable logic device integrated circuit orother integrated circuit with configurable clocking structures, twoclock tree branches may sometimes be used to convey distinct signals. Inthis situation, the memory element 20 for the appropriate configurableclock tree shorting structure (e.g., transistor 246 in the FIG. 13example) may sometimes be used to produce a control signal thatconfigures the configurable clock tree shorting structure to produce anopen circuit between its terminals. For example, transistor 246 may beturned off so that an open circuit is produced between its respectivesource-drain terminals. Shorting structures such as transistor 246 arepreferably only used to short clock paths together when the clock pathscarry the same clock signal.

The configurable shorting connections are preferably arranged such thatany time the same clock signal is routed to circuits with potentiallylarge clock skew (i.e., potentially large clock skew LABs), aconfigurable shorting connection can be enabled. With one suitabletechnique, a configurable shorting connection is provided between everypossible pair of row clocks (rowclks) on the left and right of acircuit.

A less complex arrangement involves ensuring that every clock in theclock tree (e.g., the H-tree) can be routed onto every row as a rowclk.With this arrangement, it may be possible to guarantee that those clocksthat are used on both the left and right can be routed onto the clockpaths in a row that have available shorting structures. Only a singleconfigurable shorting connection need be provided in each row in thistype of configuration.

As an example, if there were six rowclks in a given device and it wasnecessary to provide a connection between every possible pair betweenthe left and right branch, 21 connections might be required. Ifsufficient routing flexibility is provided, whenever a clock is used onboth left and right, it will use the same rowclk. In this situation,only six connections need to be provided. As this example demonstrates,flexible clock routing arrangements tend to reduce the number ofconfigurable clock tree shorting connections that are required (e.g.,from 21 to 6), but may increase the required capacity for the H-tree torowclk multiplexers such as multiplexers 240 and 236 of FIG. 13.

If desired, conventional buffers may be used to drive the rowclks (i.e.,the clocks on paths such as paths 242 and 238). However, this type ofarrangement may lead to some short circuit current if there is a periodwhen an early rowclk that is driving a different value onto a row than alate rowclk. To avoid contention of this type, it may be desirable touse low contention buffers when driving clocks onto the clock paths thatare to be selectively shorted together. Possible locations for the clockdistribution buffers are indicated by nodes 256 and 258 (as an example).Clock distribution buffers (drivers) may be located at any suitablelocations within clock tree 210 if desired.

A low-contention clock distribution driver may include drivingtransistors with fast disable and delayed enable characteristics. Anillustrative circuit for this type of low-contention clock driver isshown in FIG. 14. As shown in FIG. 14, clock driver 260 may haveparallel inverters 266 and 270. A clock signal CLK may be received atinput 272 and a corresponding strengthened version of the clock signalmay be provided at output 274. In a typical scenario, output 274 may beconnected to a clock distribution path such as one of the row paths inFIG. 13 (as an example). In this type of environment, the low contentioncapabilities of driver 260 are used to prevent short circuit currentsfrom developing in those rows in which the left and right halves of theclock distribution path have been shorted together using a passtransistor such as pass transistor 246. In general, however, driver 260may be connected in any suitable portion of a clock distribution networkon an integrated circuit.

Inverters 266 and 270 (sometimes referred to as buffers, drivers, drivecircuitry, etc.) have their outputs connected to output line 274. Apositive power supply voltage (e.g., Vcc) may be applied to positivepower supply lines 262 and a ground power supply voltage (e.g., Vss of 0volts) may be applied to ground power supply terminals 264. Inverter 266may be driven directly from clock signal input line 272. Inverter 270may be driven by a version of the clock signal that has passed throughpulse generation circuitry 268.

The strengths of inverters 266 and 270 are preferably such that inverter266 is weaker than inverter 270. The relative strengths of inverters 266and 270 may be adjusted by selecting appropriate transistor gate widthsfor the n-channel and p-channel metal-oxide-semiconductor transistors ininverters 266 and 270. For example, inverter 270 may be configured to bestronger than inverter 266 by forming inverter 270 from transistors withlarger gate widths than the transistors used in inverter 266.

Inverter 266 serves as a weak sustainer that weakly drives a clock pathto its desired value. Because of the presence of timing circuitry 268,inverter 270 exhibits a slow turn on characteristic and a fast turn offcharacteristic. As a result, during both rising and falling clocktransitions, one inverter transistor is turned off before the otherinverter transistor is turned on. This temporarily weakens the strengthof the clock driver 260 and prevents contention with a correspondingclock driver that is driving nominally identical clock signals onto theother end of a shorted clock path. A timing diagram showing therelationship between signal CLK on input line 272, signal ng (the signalon the gate of transistor TN), and pg (the signal on the gate oftransistor TP) is shown in FIG. 18.

The example of FIG. 14 is merely illustrative. In general, any suitablelow-contention clock drivers may be used when driving clocks onto clockpaths such as paths 242 and 238 of FIG. 13. When two low-contentiondrivers are connected to a single path, but are driven by inputs withslightly different delays, the driver that first receives the clocksignal will tend to drive the path to the new value while the driverthat last receives the clock signal will still be disabled for a shorttime. This avoids contention currents between the two drivers.

As described in connection with FIG. 10, pulse latch robustness isadversely affected by race through. Race through conditions arise whenthe time from clock to out (Tco) in a source latch and the propagationdelay through any intervening combinational logic is smaller than therequired amount of time that the data must be stable after the clock ata destination latch, referred to as hold time (Th). There is thus a racethrough problem when Tco+Tpd<Th. This problem may be addressed either byincreasing the magnitude of Tco or decreasing the magnitude of Th. Indoing so in a pulse latch configuration, care should be taken topreserve the high speed of the pulse latches. This may be accomplishedby allowing a logic designer to selectively increase Tco only when thereis a danger of race through.

In accordance with embodiments of the present invention, Tco may beincreased by selectively adding a secondary latch circuit (e.g., byselectively adding a negative phase latch on the output path of a givenpulse latch). An arrangement of this type is shown as pulse latchcircuit 276 of FIG. 15. As shown in FIG. 15, latch circuit 276 may havelevel sensitive latch 164 and a secondary selectively enabled latch L2.With one suitable arrangement, circuit 276 of FIG. 15 may be connectedin parallel with circuit 86 of FIG. 11 (i.e., a circuit 276 and acircuit 86 may both be connected to the same logic outputs such as acommon look-up table output and a common sdata line). In this type ofarrangement, a given logic element may have two distinct outputs, one ofwhich is associated with a latch circuit of the type shown in FIG. 15and one of which is associated with the latch circuit of the type shownin FIG. 11. If desired, circuits such as the circuits of FIGS. 11 and 15may be used separately on an integrated circuit.

Circuit 276 may be configured as either a pulse latch (by disablinglatch L2 by appropriate programming of device 10) or an edge-triggeredflip-flop (by enabling latch L2 by appropriate programming of device10). The state of pass transistor T1 may be controlled by the output ofa corresponding programmable memory element 20. By loading suitableconfiguration data into the memory element, transistor T1 may be turnedoff or on as desired.

Circuit 276 may be operated in three different modes.

In a first mode, circuit 276 may be configured to operate as a pulselatch. In this mode, control signal CA will be high, control signal CBwill be low, and transistor T1 will be turned off. This disables latchL2. Output signals from latch 164 will be routed through output buffer166 to output line 158 without being influenced by the latchingoperation of latch circuit L2.

In a second mode, circuit 276 may be operated as an edge triggered flipflop by enabling latch L2. In this mode, control signal CA will beinverted clock pulse NCP, control signal CB will be clock pulse CP, andpass gate T1 will be off. Because there are two active latches incircuit 276 in this configuration, circuit 276 serves as an edgetriggered flip flop and eliminates the risk of race through conditionsdeveloping on the integrated circuit.

In a third mode, circuit 276 operates as a combinatorial circuit. Inthis situation, latch L2 is disabled by taking control signals CA and CBlow. Transistor T1 is turned on to bypass latch L1. In thisconfiguration, combinatorial signals from input line 156 may be routedto output 158.

If desired, selectively enabled secondary latching circuitry may beadded to circuitry 276 at a location that is upstream from latch 164,rather than downstream. This type of configuration is shown in FIG. 16.As shown in FIG. 16, latch circuitry 276 may have secondary latchcircuit 292 in circuitry EL2, which serves as a secondary negative phaselatch when it is desired to reduce the risk of race through conditions.Latch circuit 292 may be based on a level sensitive latch with aninverting clock pulse input. A separate pass gate 290 can be provided tocouple the output of latch 292 into the input of pulse latch 164. Aswith latch circuitry 276 of FIG. 15, latch circuitry 276 of FIG. 16 maybe operated in three modes.

In a first mode, circuit 276 operates as a pulse latch. In this mode ofoperation, clock pulse CP may be used as pass gate control signal C1 andinverted clock pulse NCP may be used as pass gate control signal C2 inlatch 164. The control signal C3 for pass gate 290 may be set to a lowvalue, so that latch EL2 is disabled. Pass transistor T1 may be turnedoff, so that signals are routed through latch 164 to output 158.

In a second mode, circuit 276 operates as an edge-triggered flip flop.In this configuration, pass transistor R1 may be turned off. Controlsignal C1 may be set to a low value to disable pass gate 170. Controlsignal C2 may be provided with inverted clock pulse NCP and controlsignal C3 may be provided with clock pulse CP. This enables latch 292,so that signals are routed from input 156 (or input 160) through bothEL2 and latch 164 to output 158. Because circuit 276 operates as anedge-triggered flip flop in this mode, the risk of race throughconditions is eliminated.

In the third mode, latch circuit 276 is configured to operate as acombinatorial circuit. In this mode, control signals C1, C2, and C3 arelow to disable latch EL2 and latch 164. Transistor T1 is turned on, sothat combinatorial signals from input 156 may bypass both latches andmay pass to output 158 through buffer 166.

The foregoing is merely illustrative of the principles of this inventionand various modifications can be made by those skilled in the artwithout departing from the scope and spirit of the invention.

What is claimed is:
 1. Circuitry on an integrated circuit, comprising: apulse latch formed from a level sensitive latch that receives at leastone clock pulse of a given pulse width; and pulse generation circuitrythat generates the at least one clock pulse, wherein the pulsegeneration circuitry includes an input that receives a clock signal, anoutput at which the clock pulse is provided, and logic circuitry withwhich signals from the output are fed back and combined with the clocksignal, wherein the given pulse width comprises an adjustable pulsewidth, and wherein the pulse generation circuitry includes aconfigurable delay element.
 2. The circuitry defined in claim 1, whereinthe logic circuitry includes a NAND gate.
 3. Circuitry on an integratedcircuit, comprising: a pulse latch formed from a level sensitive latchthat receives at least one clock pulse of a given width; and pulsegeneration circuitry that generates the at least one clock pulse,wherein the pulse generation circuitry includes an input that receives aclock signal, an output at which the clock pulse is provided, and logiccircuitry with which signals from the output are fed back and combinedwith the clock signal, wherein the logic circuitry includes a feedbackpath and at least one NAND gate in the feedback path and wherein thepulse generation circuitry includes a configurable delay element.
 4. Thecircuitry defined in claim 3, wherein the given pulse width comprise anadjustable pulse width.
 5. The circuitry defined in claim 3, furthercomprising at least one programmable element, wherein the at least oneprogrammable element provides a static control signal to theconfigurable delay element.
 6. Circuitry on an integrated circuit,comprising: a pulse latch formed from a level sensitive latch thatreceives at least one clock pulse of a given width; and pulse generationcircuitry that generates the at least one clock pulse, wherein the pulsegeneration circuitry includes an input that receives a clock signal, anoutput at which the clock pulse is provided, and logic circuitry withwhich signals from the output are fed back and combined with the clocksignal, wherein the logic circuitry comprises a feedback path coupled tothe output, a delay element, and a configurable multiplexer thatselectively includes the delay element in the feedback path.
 7. Thecircuitry defined in claim 6, wherein the delay element comprises aplurality of series-connected inverters.
 8. The circuitry defined inclaim 6, further comprising at least one programmable element thatprovides a static control signal to the configurable multiplexer. 9.Circuitry on an integrated circuit, comprising: a pulse latch formedfrom a level sensitive latch that receives at least one clock pulse of agiven width; and pulse generation circuitry that generates the at leastone clock pulse, wherein the pulse generation circuitry includes aninput that receives a clock signal, an output at which the clock pulseis provided, and logic circuitry with which signals from the output arefed back and combined with the clock signal, wherein the pulsegeneration circuitry includes a NAND gate that receives the clock signaland the fed back signals from the output, wherein the pulse generationcircuitry includes a configurable delay element having at least twodelay settings.
 10. The circuitry defined in claim 9 wherein the clockpulse has a pulse width, wherein the fed back signals are fed back alonga feedback path, and wherein the circuitry includes a NAND gate in thefeedback path that serves at least partly to establish the pulse width.11. The circuitry defined in claim 9 wherein the clock pulse has a pulsewidth, wherein the fed back signals are fed back along a feedback path,wherein the circuitry includes a NAND gate in the feedback path, andwherein the configurable delay element comprises: a delay element; and afirst multiplexer that configurably connects the delay element into thefeedback path, wherein the NAND gate and the first multiplexer serve atleast partly to establish the pulse width when the delay element is notconnected into the feedback path.
 12. The circuitry defined in claim 11further comprising a second multiplexer in the feedback path, whereinthe second multiplexer has a control input that is controlled by theclock signal at the input.
 13. The circuitry defined in claim 9, whereinthe given pulse width comprises an adjustable pulse width.
 14. Thecircuitry defined in claim 9, wherein the configurable delay elementincludes a multiplexer.
 15. Circuitry on an integrated circuit,comprising: a pulse latch formed from a level sensitive latch thatreceives at least one clock pulse of a given pulse width; and pulsegeneration circuitry that generates the at least one clock pulse,wherein the pulse generation circuitry includes an input that receives aclock signal, an output at which the clock pulse is provided, and logiccircuitry with which signals from the output are fed back and combinedwith the clock signal, wherein the given pulse width comprises anadjustable pulse width, and wherein the logic circuitry includes atleast first and second NAND gates.
 16. The circuitry defined in claim15, wherein the first NAND gate receives the clock signal.
 17. Thecircuitry defined in claim 16, further comprising a configurable delayelement.
 18. Circuitry on an integrated circuit, comprising: a pulselatch formed from a level sensitive latch that receives at least oneclock pulse of a given pulse width; and pulse generation circuitry thatgenerates the at least one clock pulse, wherein the pulse generationcircuitry includes an input that receives a clock signal, an output atwhich the clock pulse is provided, and logic circuitry with whichsignals from the output are fed back and combined with the clock signal,wherein the given pulse width comprises an adjustable pulse width, andwherein the fed back signals are fed back along a feedback path, andwherein the circuitry further comprises: a first NAND gate that receivesthe clock signal; and a second NAND gate in the feedback path.